Threadpool merge scheduler #120869

albertzaharovits · 2025-01-26T13:59:14Z

This adds a new merge scheduler implementation that uses a (new) dedicated thread pool to run the merges. This way the number of concurrent merges is limited to the number of threads in the pool (i.e. the number of allocated processors to the ES JVM).

It implements dynamic IO throttling (the same target IO rate for all merges, roughly, with caveats) that's adjusted based on the number of currently active (queued + running) merges.
Smaller merges are always preferred to larger ones, irrespective of the index shard that they're coming from.
The implementation also supports the per-shard "max thread count" and "max merge count" settings, the later being used today for indexing throttling.
Note that IO throttling, max merge count, and max thread count work similarly, but not identical, to their siblings in the ConcurrentMergeScheduler.

The per-shard merge statistics are not affected, and the thread-pool statistics should reflect the merge ones (i.e. the completed thread pool stats reflects the total number of merges, across shards, per node).

This adds a new merge scheduler implementation that uses a (new) dedicated thread pool to run the merges. This way the number of concurrent merges is limited to the number of threads in the pool (i.e. the number of allocated processors to the ES JVM). It implements dynamic IO throttling (the same target IO rate for all merges, roughly, with caveats) that's adjusted based on the number of currently active (queued + running) merges. Smaller merges are always preferred to larger ones, irrespective of the index shard that they're coming from. The implementation also supports the per-shard "max thread count" and "max merge count" settings, the later being used today for indexing throttling. Note that IO throttling, max merge count, and max thread count work similarly, but not identical, to their siblings in the ConcurrentMergeScheduler. The per-shard merge statistics are not affected, and the thread-pool statistics should reflect the merge ones (i.e. the completed thread pool stats reflects the total number of merges, across shards, per node).

…ep up with the merge load (elastic#125654) Fixes an issue where indexing throttling kicks in while disk IO is throttling. Instead disk IO should first unthrottle, and only then, if we still can't keep up with the merging load, start throttling indexing. Fixes elastic/elasticsearch-benchmarks#2437 Relates elastic#120869

The intent here is to aim for fewer to-do merges enqueued for execution, and to unthrottle disk IO at a faster rate when the queue grows longer. Overall this results in less merge disk throttling. Relates elastic/elasticsearch-benchmarks#2437 elastic#120869

Fixes elastic#125639 Relates elastic#120869

…gMergeTasks (#126058) Fixes #125842 Relates #120869

…gMergeTasks (elastic#126058) Fixes elastic#125842 Relates elastic#120869

…gMergeTasks (#126058) (#129516) Fixes #125842 Relates #120869

…gMergeTasks (#126058) (#129515) Fixes #125842 Relates #120869

This deprecates the `indices.merge.scheduler.use_thread_pool` setting that was introduced in #120869 because this setting should not normally be used, unless instructed so by engineering to get around temporary issues with the new threadpool-based merge scheduler.

…9464) This deprecates the `indices.merge.scheduler.use_thread_pool` setting that was introduced in elastic#120869 because this setting should not normally be used, unless instructed so by engineering to get around temporary issues with the new threadpool-based merge scheduler.

…129629) This deprecates the `indices.merge.scheduler.use_thread_pool` setting that was introduced in #120869 because this setting should not normally be used, unless instructed so by engineering to get around temporary issues with the new threadpool-based merge scheduler.

…9464) (#129628) * Deprecate indices.merge.scheduler.use_thread_pool setting (#129464) This deprecates the `indices.merge.scheduler.use_thread_pool` setting that was introduced in #120869 because this setting should not normally be used, unless instructed so by engineering to get around temporary issues with the new threadpool-based merge scheduler. * Update warning msg

…9464) This deprecates the `indices.merge.scheduler.use_thread_pool` setting that was introduced in elastic#120869 because this setting should not normally be used, unless instructed so by engineering to get around temporary issues with the new threadpool-based merge scheduler.

This documents the new threadpool-based merge scheduler, which is disk space aware, and blocks merges when disk space is low. The code changes were mostly introduced in #120869 and #127613 .

This documents the new threadpool-based merge scheduler, which is disk space aware, and blocks merges when disk space is low. The code changes were mostly introduced in elastic#120869 and elastic#127613 .

This documents the new threadpool-based merge scheduler, which is disk space aware, and blocks merges when disk space is low. The code changes were mostly introduced in #120869 and #127613 .

…k space aware, and blocks merges when disk space is low. The code changes were mostly introduced in elastic#120869 and elastic#127613 .

…k space aware, and blocks merges when disk space is low. The code changes were mostly introduced in #120869 and #127613 . (#130530)

This documents the new threadpool-based merge scheduler, which is disk space aware, and blocks merges when disk space is low. The code changes were mostly introduced in elastic#120869 and elastic#127613 .

albertzaharovits and others added 30 commits January 16, 2025 17:27

ExecutorMergeScheduler

bf557d2

Merge branch 'main' into threadpool-merge-scheduler

a3f87df

[CI] Auto commit changes from spotless

f5a1a8d

wrap for merge in the executor merge scheduler

f0b72fe

spotless

9b03950

Merge branch 'main' into threadpool-merge-scheduler

26e4043

Fix InternalEngineTests

aba69d0

Merge branch 'main' into threadpool-merge-scheduler

52796b5

implemented Throttling

c0667bf

Merge branch 'main' into threadpool-merge-scheduler

2da753f

[CI] Auto commit changes from spotless

2c8dc7f

Checkstyle

81cc0f1

Fix threadpool size for SnapshotResiliencyTests

f58120f

Spotless

5ca992d

Nit

3c203cb

Implemented max thread setting

6c21654

Throttling ?

68079d9

Checkstyle

7b68ba9

Indexing throttling !

9e467a1

Better throttling logging

a8f5297

Merge branch 'main' into threadpool-merge-scheduler

928fd32

Don't wrap errors during merging

3f5b4a8

Merge branch 'main' into threadpool-merge-scheduler

0e714a1

Merge branch 'main' into threadpool-merge-scheduler

0297cce

Refresh config

2b79809

Nit

57c2a5c

WIP

60a71b8

Merge branch 'main' into threadpool-merge-scheduler-sort-all-merges

68db209

IO throttling

4099ac5

Merge branch 'main' into threadpool-merge-scheduler-sort-all-merges

5554bc2

This was referenced Jun 9, 2025

WIP Threadpool merge scheduler #120293

Closed

WIP Threadpool merge scheduler sort all merges #120733

Closed

albertzaharovits added a commit to albertzaharovits/elasticsearch that referenced this pull request Jun 9, 2025

Fix testMergeSourceWithFollowUpMergesRunSequentially (elastic#126050)

c43805e

Fixes elastic#125639 Relates elastic#120869

This was referenced Jun 9, 2025

[9.0] New threadpool-based merge scheduler which is disk space aware #129134

Merged

[8.19] New threadpool-based merge scheduler which is disk space aware #129152

Merged

Deprecate indices.merge.scheduler.use_thread_pool setting #129464

Merged

elasticsearchmachine pushed a commit that referenced this pull request Jun 16, 2025

Fix ThreadPoolMergeExecutorServiceTests testIORateIsAdjustedForRunnin…

d98327c

…gMergeTasks (#126058) Fixes #125842 Relates #120869

albertzaharovits added a commit to albertzaharovits/elasticsearch that referenced this pull request Jun 17, 2025

Fix ThreadPoolMergeExecutorServiceTests testIORateIsAdjustedForRunnin…

4b01793

…gMergeTasks (elastic#126058) Fixes elastic#125842 Relates elastic#120869

albertzaharovits added a commit to albertzaharovits/elasticsearch that referenced this pull request Jun 17, 2025

Fix ThreadPoolMergeExecutorServiceTests testIORateIsAdjustedForRunnin…

94fb9de

…gMergeTasks (elastic#126058) Fixes elastic#125842 Relates elastic#120869

albertzaharovits added a commit that referenced this pull request Jun 17, 2025

Fix ThreadPoolMergeExecutorServiceTests testIORateIsAdjustedForRunnin…

12411fd

…gMergeTasks (#126058) (#129516) Fixes #125842 Relates #120869

albertzaharovits added a commit that referenced this pull request Jun 17, 2025

Fix ThreadPoolMergeExecutorServiceTests testIORateIsAdjustedForRunnin…

eb22fa1

…gMergeTasks (#126058) (#129515) Fixes #125842 Relates #120869

albertzaharovits mentioned this pull request Jul 2, 2025

[DOCS] Disk space aware threadpool merge scheduler #130465

Merged

albertzaharovits mentioned this pull request Jul 3, 2025

[8.19] [DOCS] Disk space aware threadpool merge scheduler #130530

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Threadpool merge scheduler #120869

Threadpool merge scheduler #120869

Uh oh!

albertzaharovits commented Jan 26, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Threadpool merge scheduler #120869

Threadpool merge scheduler #120869

Uh oh!

Conversation

albertzaharovits commented Jan 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

albertzaharovits commented Jan 26, 2025 •

edited

Loading